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CHAPTER  I 


INTRODUCTION 

A  BCH  code  is  a  linear  cyclic  block  code  with  a  generator  polynomial,  g{x), 
that  is  a  product  of  minimal  polynomials,  where  the  2t  designed  zeros  of  g(x)  have 
contiguous  powers  of  /5.  Note  that  t  is  the  designed  number  of  correctable  errors, 
do  =  2t  +  1  is  the  designed  distance,  /3  =  a*’,  a  is  a  primitive  element  of  the  field 
GF(Q),  b  =  (Q  —  l)/n,  and  n  is  the  length  of  a  channel  codeword.  Often  the 
true  minimum  distance  of  the  BCH  code  is  larger  than  do.  A  decoder  based  on  the 
Berlekamp-Massey  Algorithm  (BMA)  makes  use  of  the  designed  distance  only  [1], 
however  the  Fundamental  Iterative  Algorithm  (FIA)  can  be  used  to  exploit  some  or 
all  of  the  code’s  imused  distsince,  in  many  cases  increasing  the  number  of  correctable 
errors  [3]. 

Often,  the  critical  issue  is  not  the  number  of  correctable  errors,  but  the  per¬ 
formance  gain/hardware  complexity  ratio.  Since  hardware  complexity  is  difficult  to 
measure,  this  thesis  describes  and  compares  hardware  realizations  for  both  decoding 
algorithms  in  light  of  their  corresponding  performance  gains. 
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CHAPTER  II 


DECODING  ALGORITHMS 
The  Berlekamp>-Massey  Algorithm 

Given  a  BCH  code  over  GF{q)  with  a  GF{q”')  error  locator  field,  the  generator 
polynomial  is  g{x)  =  LG M[mi{x),Tn2{x),  ...,Tn2t{x)],  where  mi{x)  is  the  minimal 
polynomial  for  for  i  =  1, 2, 2f.  Note  that  0  <  jo  <  q”"  -  2,  P  =  a  is  a 

primitive  element  of  GF(q"^),  b  =  (g"*  —  l)/n,  and  n  is  the  number  of  code  symbols  in 
a  codeword.  Considering  the  k  coefficients  of  i{x)  as  an  information  block,  a  channel 
codeword  is  obtained  by  polynomial  multiplication 

c{x)  =  i{x)g{x), 

where  deg{i(rc)}  =  A:  -  1,  deg{p(a;)}  =  n-k,  deg{c(x)}  =  n  -  1,  and  A:  <  n. 

The  channel  errors  can  be  expressed  as 

V 

i=i 

where  i/  is  the  number  of  errors,  Ij  is  the  j*'^  error  position,  and  ei^  is  the  corresponding 
error  weight.  Assuming  the  n  symbols  of  the  received  codeword  are  the  coefficients 
of  v{x), 

v{x)  =  c(x)  +e(x), 
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since  the  channel  errors  are  modeled  to  be  additive.  The  syndrome  values  can  be 
calculated  as 

for  i  =  1, 2, 2t.  By  defining  Yj  =  Ci-  and  Xj  -  /3'j  for  j  =  1, 2, u,  the  syndrome 
equation  can  be  rewritten  as 

j=\ 

for  i  =  1,2,  ...,2t.  Therefore,  there  are  2t  non-linear  simultaneous  equations  that 
relate  the  channel  errors  to  the  syndromes.  An  error  locator  polynomial  has  roots 
XJ^,  for  j  =  1,2,.,.,!^,  and  can  be  used  with  a  little  deduction  to  reveal  the  error- 
syndrome  relationship 

A(x)  —  Kv’x''  A^_i  •  x''~^  -I- ...  -f  Ai  •  X  -f  1  =  (1  —  X  •  .?Ci)(l  —  X  •  X2)...(l  —  x  •  X^), 

YjX}-^'' .  A{Xj^)  =  YjXi+'' .  (A,. .  XT’'  -p  A,._i  •  Xr''+^  +  ...  +  Ai  •  X-^  +  1)  =  0, 

Yj .  (A, .  X}  +  A,-i  •  x}+^  4- ...  +  Ai .  -p  =  0, 

Yi  yj  •  (A.  •  X}  +  A,_i  •  '  -p ...  -p  Ai  •  -p  a:;+')  =  o, 

3=1 

A,  ■£  YiX)  +  K-ijz  W  +  -  +  Ai  E  nA:r‘"'  +  E  =  o. 

i=l  j=l  j=\  i=i 


Assruning  jo  =  1, 
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Au  •  Si  +  A„-i  •  5i+i  + ...  +  Ai  •  S„^i-i  +  Su+i  =  0, 


Ai  •  Su+i-i  +  ...  4-  Aj/_i  •  iSi+i  +  A„  •  Si  =  —S, 


u+ii 


for  i  —  1,2, ...,  2t.  All  2t  equations  can  be  expressed  at  once  with  the  following  matrix 
equation 


“ 

- 

...  5i/_j  5j/ 

A. 

~SI/+1 

^2  53 

...  Su  5^+1 

• 

A^_i 

= 

—  Sv+2 

5|/  5t/+i 

...  S2U-2  Siu-i 

- 1 

*  c 

-52. 

Assiiming  that  v  <  t,  the  BMA  can  be  used  to  find  the  coefficients  of  the 
error  locator  polynomial  A(a;)  [1],  The  initialization  of  the  BMA  depends  on  the  first 
syndrome.  If  Si  ^  0,  A(“)(a;)  =  1,  A(^)(x)  =  1  -  S'jX,  L  =  1,  m  =  1,  Am  =  Si, 
and  r  =  2;  otherwise,  A(-^)(a;)  =  A(®)(x)  =  A(^>(a;)  =  1,  L  =  0,  m  =  0,  Am  =  1, 
and  r  =  2.  A(’'^(a;)  is  the  current  approximation  of  the  error  locator  polynomial,  L  is 
the  stored  register  length,  m  is  the  stored  stage  number.  Am  is  the  stored  syndrome 
error,  r  is  the  ciurrent  stage  number,  5'  is  the  approximation  of  the  r*'*  syndrome,  and 
Ar  is  the  current  syndrome  error.  The  following  pseudo  code  describes  the  inductive 
loop  of  the  BMA: 
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while  (r  <  2t) 

{ 

Cf  _  •^n-l 

Of  —  —  2^j=:i  Of^j 

Ar  =  Sr-  S'r 
if  (Ar  ^  0) 

{ 

A  —  —Ar. 

~  Am 
l  =  T  —  m 

A^^\x)  =  +  Aa;'A^"‘“^)(a;) 

if  (deg{A(’')(x)}  >  L) 

{ 

L  =  deg{A^’')(a;)} 
m  =  r 
Am  —  Ar 

} 

} 

r  =  r  +  1 
}• 

\(P)(^x)  is  the  error  locator  polynomial  A(x),  and  L  is  the  number  of  received  symbol 
errors  i/.  If  deg{A(x)}  ^  L,  a  decoding  failure  has  occurred  [1]. 

A  Chien  search  is  an  evaluation  of  a  polynomial  at  all  possible  values  of  /3®, 
where  x  is  arbitrary.  This  search  can  be  used  to  find  the  u  zeros  of  A{x).  The  Forney 
Algorithm  [1]  can  then  be  used  to  find  the  error  weights 

"  A>{p-hy 
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where 

^(a:)  =  5'(a;)A(a;)(moda;^), 
s(x)  = 

A'W  =  E 

t=l 

Now,  e(x)  is  fully  specified  and 

c(x)  =  v(x)  —  e(x), 
i(x)  =  c(x)/g(x). 

For  the  (17,9)  BCH  code,  the  error  locator  field  is  GF(2*),  6  =  15,  /3  = 
g(x)  =  (x-/3)‘(x-  /3^)  ■{x-/3*)‘{x-  f^)  •  (x  -  ^)  ■  (x  -  •  (x  -  •  (x  - 

and  the  syndromes  are  calculated  as  follows,  assuming  jo  =  1, 

51  =  v{p), 

52  =  v{0^)  =  {S,f. 

This  last  equation  can  be  proven  by  considering  a  generic  polynomial  defined  over 
GF{q),  where  q  is  the  characteristic  of  the  field  GF{Q), 

fi^)  =  Ylfi’ 

t=0 


El 

j=l 


Aix(‘-^). 


If  the  argument  of  / (x)  is  raised  to  the  q  power. 
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i=0 

Since  fi  €  GF{q),  fi  =  f-.  Therefore  [1], 

fi^^)  =  E  f!  ■  =  f  E  fi  ■  =  imY  ■ 

t=0  Li=0 

For  all  single-error  correcting  binary  BCH  codes,  the  assumption  that  jo  =  1 
forces  the  Berlekamp-Massey  Algorithm  to  degenerate  into  a  mere  comparison.  If 
=  0,  then  A{x)  —  1  and  L  —  0.  Therefore,  the  received  codeword  has  no  errors, 
or  a  decoding  error  has  occurred.  In  this  case,  the  received  codeword  is  passed  as 
the  decoded  codeword.  If  Si  ^  0,  then  A(a;)  =  1  —  •  a;  and  L  =  1.  Therefore, 

the  received  codeword  has  a  single  error,  or  a  decoding  error  has  occurred.  For  this 
case,  the  zero  of  the  error  locator  polynomial  is  extracted  by  simply  noting  that  for 
A(a;)  =  0,  a;  =  1/Si.  Next,  the  error  location  is  given  by  li  =  log^(S'i),  and  the  li  bit 
of  the  received  codeword  is  complemented.  These  two  cases  constitute  all  possibilities 
since  S2  =  (Si)^.  Therefore,  decoding  failures  are  impossible. 

The  Fundamental  Iterative  Algorithm 

The  FIA  uses  some  or  all  the  zeros  of  g{x)  that  the  BMA  does  not,  to  try 
to  increase  the  decoded  distance  of  the  code.  Given  the  same  received  block  v(x) 
discussed  for  the  BMA,  the  syndromes  for  the  matrix  E  can  be  calculated  such  that 
Si  =  v(^),  where 
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5a 

5o+jiCi 

... 

Sa+ij/Ci 

5a+c2 

5o+tlCl+C2 

... 

*So+tj/Ci+C2 

Sa+(t'-l)c2  Sa+iici+(t'-l)c2  •••  ‘5o+ij/ci+(t'-l)c2  J 

Note  that  tf  is  the  FIA’s  number  of  correctable  errors,  as  opposed  to  t,  which  is 
the  BMA’s  number  of  correctable  errors.  The  variables  a,  ci,  c^,  ii,  i2,  ...  ,  if  are 
code-specific  and  have  permanent  values  for  each  code,  where  0  <  <  ...  <  if. 
Maximizing  t',  these  permanent  values  are  deduced  by  trial  and  error  using  the  whole 
set  or  a  subset  of  the  zeros  of  g(x)  [3]. 

Since  E  possesses  recurrent  rows  (i.e.  rows  equally  spaced  by  an  index  of  C2), 
the  decoder  algorithm  complexity  is  0(n^).  With  Cj  =  C2  =  1  and  ij^i  =  ij  +  1,  for 
all  j  6  0, ...,  —  1,  where  io  =  0,  the  FI  A  degenerates  into  the  BMA  since  the  entries 

of  E  constitute  a  single  set  of  2t  known  consecutive  syndromes  [3]. 

Given  E,  if  the  number  of  symbol  errors  in  the  received  codeword  is  less  than 
or  equal  to  t',  then  the  FI  A  can  be  used  to  find  the  coefficient  polynomial  ‘ip{x)  and 
the  rank  of  E,  A.  If  the  FIA  degenerates  into  the  BMA,  ‘iIj{x)  =  A(x).  Otherwise,  the 
relationship  of  ^(x)  and  A(x)  is  not  as  clear.  In  general, 

/(^)  = 

j=0 


where  io  =  0.  Also, 


9 


f{x)  =  h{x)  ■  A{l/x)  , 

where  h{x)  E  GF{q^)[x\.  Therefore,  i){x)  and  A(x)  have  a  relationship  that  is  clouded 
by  h{x),  which  is  codeword-dependent  [3]. 

To  initialize  the  FIA,  the  discrepancy  list  is  set  to  zero  {D  =  [0,0,  ...,0]),  the 
discrepancy  row  position  list  is  set  to  zero  [u  =  [0,0,  ...,0]),  the  column  position  is 
set  to  1  (s  =  1),  the  row  position  is  set  to  1  (r  =  1),  and  tp{x)  =  1.  The  following 
pseudo  code  describes  the  inductive  section  of  the  FIA,  where  dr,s  is  the  discrepancy 
for  the  current  row  and  column  and  =  1-1- —r.t  ’  [3])  [4]: 

label  1; 

dr,s  =  [^{x)  •  =  n=o  V’i  • 

label  2; 

if  {dr,s  =  0) 

{ 

if  (r  =  If) 

{ 

X  =  s-1 
END 
} 

else 

{ 

r  =  r  -f  1 
goto  label  1 

} 


else 
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{ 

if  {Dr  =  0) 

{ 

Dr  =  dr,s 
Ur  =  S 

=  ‘tp{x) 

S  —  S  +  1 

r  =  1 

goto  label  1 

} 

else 

{ 

‘ip{x)  =  tl){x)  —  ^  •  x^~^ 

dr, 8  =  0 

goto  label  2 

} 

}• 

With  '4>{x)  and  A  known,  the  syndrome  dependence  polynomial  f{x)  can  be 
constructed.  Since  K{l/x)  has  zeros  of  the  form  where  cj  is  the  position  of  the 
potential  error,  a  Chien  search  can  be  used  on  f{x)  to  eliminate  as  many  zeros  of 
h{x)  as  possible.  All  the  zeros  found  with  the  Chien  search  are  considered  members 
of  the  set  U,  where  \U\  =  S.  Using  these  members,  a  polynomial  similar  to  A(x)  can 
be  constructed 

r(x)  =  fl(i-x-u,). 

1=1 

Some  zeros  of  h(x)  might  be  used  in  building  r(a;).  Otherwise,  r(a;)  =  A(a;),  which  is 
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capable  of  revealing  If  or  less  error  locations,  as  opposed  to  the  BMA’s  A{x),  which 
can  only  reveal  t  or  less  error  locations  [3]. 

Given  the  largest  set  of  known  consecutive  syndromes  the  Forney  Algorithm 
requires  |^|  >  S  or  more  specifically  |^|  =  max{(5}.  For  all  cases  where  E  has  entries 
that  form  two  or  more  distinct  sequences  of  known  consecutive  syndromes  (i.e.  multi¬ 
sequence  cases),  1^1  <  2t'  -  1.  Therefore,  if  6  >  2t',  the  number  of  known  consecutive 
syndromes  must  be  increased.  These  extra  syndromes  may  or  may  not  be  foimd  using 
the  n  syndrome  linear  dependence  equations  or  “patch”  equations  described  by 


"h  /l^o+»A-iCl+ic2  "h  •••  “h  fx^a+jc2  0, 


where  je0,...,n— 1.  Only  t'  of  these  equations  are  necessary,  one  for  each  possible 
non-zero  value  of  X.  The  value  of  j  for  each  equation  can  be  found  by  trial  and  error. 
If  the  extra  syndromes  cannot  be  foxmd,  the  FIA  will  not  be  able  to  correct  t'  errors 
even  though  Eis  a  tf  xt'  +  1  matrix  of  the  form  described  previously  [3]. 

Assuming  |^|  >  an  adapted  form  of  the  Forney  Algorithm  can  be  used  to 


find  the  error  weights 


Q{x)  =  S'(x)r(a:)(modrE'^l), 
1(1 

S(x)  = 

J=1 


where 
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r(x) 


s 

=  E 


t=i 


El 

i=i 


TiX^ 


(i-i) 


Therefore,  e(x)  is  fully  specified  by 


e(^)  =  El^^i 

i=i 

and, 


c{x)  =  t;(x)  —  e(a:), 
i{x)  =  c(a:)/p(a;). 


For  the  (17, 9)  code,  tf  =  2.  The  syndrome  matrix  E  is 

51  Sg  Si6 

52  Sg  Sie 

where  a  =  1,  ci  =  1,  C2  =  1,  i\  =  7,  and  ^2  =  14.  Since  <5  <  8,  |^|  needs  to  be  8,  so 
the  following  patch  equations  were  foimd  by  trial-and-error  by  evaluating  the  linear 
dependence  equation  at  j  =  0,  ...,n  —  1: 

‘S'ls  ’  V'l  +  S's  =  0,  for  A  =  1  and  j  =  14, 

■  Sg  - '4)2  +  Si^  ''4>i+  Sg  =  0,  for  A  =  2  and  j  =  7. 

For  j  =  t', n  —  1,  the  equation  does  not  contain  syndromes  from  a  particular  row  in 
E.  These  syndromes  are  contained  in  a  particular  extension  row  of  E.  For  example, 
Sg,  Sig,  and  Sg  comprise  the  sixth  extension  row  of  E,  which  corresponds  to  j  =  7. 
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For  j  =  14,  S'ls  and  comprise  the  first  two  elements  of  the  thirteenth  extension 
row.  Note  that  S2,  1S4,  Ss,  Sg,  S13,  Si^,  and  iS'ie  are  all  known.  If  S3,  S5,  Sq,  and 
S7,  could  be  foimd,  |^|  =  8.  Therefore,  for  each  possible  value  of  A,  the  syndrome 
dependence  equation  was  examined  for  each  value  of  j,  imtil  it  became  an  equation 
with  a  single-term  unknown  that  was  in  the  conjugate  set  containing  S3,  S5,  Se,  S7, 
Sio,  Su,  S12,  and  Su. 

The  generator  polynomial  is  the  same  as  the  that  for  the  BMA,  but  the  fol¬ 
lowing  version  is  multiplied  out  to  show  the  coefficients  in  GF{2) 


g{x)  =  +  X  +  1. 


As  an  example,  assume  that  the  information  polynomial  is  i{x)  =  1  and  the  error 
polynomial  is  e{x)  =  x*  -t-  1.  Then,  the  codeword  polynomial  is  c{x)  =  x^  +  x"^  + 
X®  -t-  -f  -I-  a;  -1- 1,  and  the  received  polynomial  is  t;(a;)  =  a;’^  -I-  a;®  -1-  a;^  4-  -I-  x. 

Calculating  syndrome  Si  and  using  the  relationships  S2  =  (‘S'l)^,  Ss  =  (5'2)^,  iS'ie  = 
(Ss)^,  Si5  =  (Sie)^,  and  Sg  =  {S15)*  gives 


a® 


^18  ^132  ^144 


Now,  'ip{x)  and  A  can  be  found  with  the  FIA: 


Initialization: 

D  =  [0,0];u  =  [0,0];s  =  l;r  =  l-,tP{x)  =  1 
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Loop  1: 

di,i  =  rpo  ■  Hi,i  =  O'® 

D  =  [a®,  0];  u  =  [1, 0];  =  1;  s  =  2;  r  =  1 

Loop  2: 

<^1,2  =  V’o  '  “1,2  +  V’l  •  “1,1  = 

'ip(x)  =  xp(x)  —  ^  •  a:  =  1  +  a^^x;  r  =  2 

Loop  3: 

^2,2  =  V’o  *  “2,2  +  V’l  •  “2,1  = 

D  =  [a®,  u  =  [1, 2];  '4)^'^\x)  =  1  +  a^x\  s  =  3;  r  =  1 
Loop  4: 

d\,z  =  V’o  •  Hi, 3  +  xj)i  •  El, 2  +  ^*2  •  Hi,i  = 

V'(x)  =  i’ix)  —  ^  •  ij}^^^(x)  ■  x^  =  1  +  a^x  +  r  =  2 

Loop  5: 

^2,3  =  V'o  ■  “2,3  +  ^1  •  ■=^,2  +  ^2  •  “2,1  = 

‘ip{x)  —  xl){x)  —  ^  •  X  =  1  +  +  oi’^x'^ 

X  =  2 

This  xl}{x)  and  A  yield  f{x)  =  x***  +  +  a^®.  Using  a  Chien  search,  the  zeros  of 

f{x)  are  a®'^®  and  a®’^®;  therefore,  r(x)  =  a^^°x^  +  a^x  +  1.  For  this  code,  |^|  =  8. 
Also,  for  this  example,  A  =  2.  Therefore,  Ss,  Sq,  and  Sj  must  be  determined 
using  the  A  =  2  patch  equation  in  order  to  have  8  known  consecutive  syndromes.  All 
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four  imknown  syndromes  are  in  the  same  conjugate  set,  so  1S5  can  be  deduced  from 
the  patch  equation  and  then  the  following  equations  can  be  used  to  find  S3,  Se,  and 

57: 

{S,)‘  =  S3, 

(S3)"  =  S3, 

(Sc)"  =  S,. 

It  may  be  confusing  why  |^|  =  max{5}  =  8.  This  value  is  determined  by  constructing 
a  FIA  decoder  in  software.  This  software  is  used  to  decode  all  possible  error  pat¬ 
terns  containing  tf  or  less  errors  starting  with  |^|  =  and  increasing  |^|  after  each 
attempted  verification  imtil  the  decoder  corrects  all  of  these  error  patterns.  This 
process  finds  the  minimum  value  of  |^|  required  to  properly  decode  t'  or  less  errors. 

With  S\  through  S3  and  r(x)  known,  the  Forney  algorithm  can  be  used  to 
identify  which  zeros  of  / (x)  of  the  form  correspond  to  actual  error  locations.  For 
this  example,  both  zeros  are  identified,  but  there  are  many  cases  where  the  identified 
zeros  are  a  subset  of  the  zeros  found  with  the  Chien  search.  With  the  two  zeros 
identified,  e(x)  =  x*  +  1,  which  is  the  correct  error  pattern. 

The  relation  between  the  FIA  and  the  BMA  is  not  easily  seen;  however,  there 
is  one.  Starting  with  column  one,  the  FIA  uses  a  trimcated  BMA;  it  starts  out  like 
the  BMA  but  stops  if  a  discrepancy,  dr,«,  is  non-zero.  This  discrepancy  and  its  column 
number  are  stored  as  Dr  and  Uj.  along  with  the  current  approximation  of  the  coefficient 
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polynomial,  '4>{x),  which  is  stored  as  For  the  other  columns,  the  initialization 

is  dependent  upon  all  previous  columns;  the  procedure  is  similar  to  the  procedure  for 
the  first  column,  except  when  a  non-zero  discrepancy  is  encoimtered,  the  trimcated 
BMA  will  stop  only  if  no  discrepancy  has  been  recorded  for  that  particular  row,  in 
essence,  if  Dr  =  Q.  If  dr, a  ^  0  and  Dr  ^  0,  ip{x)  is  updated.  The  algorithm  continues 
this  pattern  for  each  column  until  it  is  able  to  perform  the  full  BMA  on  a  column.  It 
is  possible  that  the  full  BMA  might  be  performed  on  the  first  column,  but  this  would 
indicate  that  no  errors  could  be  foimd. 


CHAPTER  III 


PERFORMANCE  COMPARISONS 

Since  theoretical  performance  measures  of  BCH  codes  are  inexact,  simulation 
was  necessary  to  empirically  measure  performance.  Therefore,  a  baseband  digital 
communication  system  using  BMA  and  FIA  decoders  was  programmed  on  a  computer. 
This  simulated  system  was  used  to  approximate  the  bit  error  rate  (BER)  at  the 
decoder  output  versus  SNR  curve  for  both  the  BMA  and  the  FIA.  This  curve  is 
important  since  it  gives  insight  into  the  system  coding  gain,  which  is  an  important 
performance  indicator. 

Let  Pu{h)  be  the  BER  for  the  imcoded  system  operating  at  the  SNR  h,  and 
let  Pc(h)  be  defined  in  a  similar  way  for  the  coded  system.  Then,  the  coding  gain  is 
defined  as 


9{p)  =Pc\p)-Pu\p), 

where  p  is  the  BER  and  all  SNR’s  are  in  dB.  Therefore,  the  coding  gain  at  a  given  BER 
is  the  difference  in  SNR  of  the  imcoded  and  coded  systems  when  each  are  operating 
at  a  specific  BER  [11]. 
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The  Simulated  System 
This  statistical  computer  simulation  entails: 

1.  Generating  an  X  bit  maximum  entropy  encoder  input  sequence, 

2.  Encoding  in  k  symbol  blocks  at  a  k/n  input/output  rate,  which  involves  con¬ 
structing  an  n-length  channel  codeword  from  a  fc-length  source  codeword,  where 
each  component  is  a  binary  symbol, 

3.  Generating  an  effective  channel  error  sequence  of  length  Xn/k  such  that  the 
errors  are  exponentially  spaced.  The  probability  of  an  error  is  approximately 
the  BER  of  the  coded  channel,  and  the  number  of  decoder  bit  errors  x  is  large 
(greater  than  100)  [2], 

4.  Adding  the  error  sequence  to  the  encoded  sequence  bit  by  bit  in  GF{2), 

5.  Decoding  in  n  symbol  blocks  at  an  n/k  input/output  rate. 

It  is  assiuned  that  somce  encoding,  if  necessary,  heis  been  applied  so  that  the 
binary  symbols  at  the  encoder  input  are  independent  and  equally  likely.  This  input 
can  be  modeled  by  a  pseudo-noise  (PN)  sequence.  A  PN  sequence  is  periodic  with  a 
maximum  period  of  =  2''  —  1,  where  r  is  the  number  of  stages  in  the  generating 
shift  register  [10].  In  many  cases,  tap  weights  can  be  selected  so  that  the  maximum 
period  is  achieved,  resulting  in  a  maximal-length  sequence  [10].  Each  maximal-length 
sequence  has  at  least  three  non-zero  taps.  Those  sequences  with  only  three  taps  are 
the  most  time  efficient,  since  the  generation  speed  increases  as  the  number  of  non-zero 
taps  decrease.  The  periodic  autocorrelation  Rp{u)  is  defined  as  {Nq  —  Ni)/{Nq -|- Ni), 
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where  p  is  a  sequence  with  period  N  =  Nq  +  Ni ,  s  is  the  niodulo-2  addition  of  p  and 
its  u  position  cyclic  shift,  No  is  the  number  of  O’s  in  s,  and  Ni  is  the  number  of  I’s 
in  s.  Any  modulo-2  addition  of  a  maximal-length  sequence  with  its  u  position  cyclic 
shift  is  the  u'  position  cyclic  shift  of  the  maximal-length  sequence.  Also,  the  niunber 
of  1  ’s  in  a  maximal-length  sequence  is  one  greater  than  the  number  of  0 ’s;  therefore, 
the  periodic  autocorrelation  function  of  any  maximal-length  sequence  is  [10] 

{1.0  ii  u  =  IN 
if  uj^lN 

where  I  E  I,  As  r  approaches  oo,  N  approaches  oo,  and  the  correlation  dwindles  to 

0. 

For  a  simulation,  =  2®  —  1,  where  x  —  \l0g2X]  and  X  is  the  number  of 
input  bits  required  by  the  encoder  for  an  entire  simulation.  X  is  difficult  to  determine, 
since  the  calculation  of  X  is  based  on  the  coded  BER  p 

X  = 

pn  ’ 

where  x  is  the  number  of  bit  errors  at  the  output  of  the  decoder.  The  BER  at  the 
decoder  output  is  actually  a  random  variable  (RV),  which  can  have  a  multimodal 
distribution  if  X  is  not  large  enough.  The  goal  is  to  choose  X  so  that  x  is  large 
(greater  than  100)  [2]. 

The  probability  of  a  bit  error  at  the  output  of  the  raw  channel  when  an  encoder 
and  decoder  are  present  is  the  BER  of  the  coded  cheinnel  'ipQ.  If  >  10~^,  the  effective 
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channel  error  sequence  can  be  constructed  by  thresholding  successive  outputs  of  a 
imiform  pseudo  random  number  with  ‘4)^.  If  the  output  is  less  than  or  equal  to  V’oi  a 
1  is  declared;  otherwise,  a  0  is  declared.  If  <  10“^,  the  following  approach  is  used. 
The  location  of  the  first  error  is  calculated  by  [6] 


where  As  is  the  number  of  zero  sequence  elements  imtil  an  error  occurs  and  «  is  a 
uniform  pseudo  random  mmiber.  Subsequent  error  locations  can  be  determined  by 
using  the  same  equation  where  As  is  the  number  of  zero  sequence  elements  until  the 
next  error  and  u  is  a  new  uniform  pseudo  random  munber  [6]. 


The  Results 

For  the  (17,9)  BCH  code  with  g{x)  =  m\{x)  and  a  minimiun  distance  d  =  5, 
the  exploited  distance  is  4  for  the  BMA  and  5  for  the  FIA,  yielding  t  =  l  and  t'  =  2, 
respectively.  Figure  3.1  shows  the  corresponding  theoretical  performance  curves  of 
error  probability  versus  Eh/No.  The  theoretical  BER  at  the  output  of  the  decoder  is 
given  by  [7] 


n 

P(bit  error  at  decoder  output)  =  ^ 

j=t+i 


( 

\ 

i 

n 

n 

/ 

.(1-p)”-^- 


J 


where 
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P(j  errors  in  codeword)  = 


/  \ 
n 


■P^  •  (1  -P)"  ^ 


P(bit  error  |  j  errors  in  a  codeword)  =  — . 

Note  that  for  the  FIA  the  summation  actually  ranges  from  +  1  to  n.  For  the 
theoretical  plots,  the  FIA  coding  gain  is  positive  for  all  SNR’s  above  3.2dB,  and  the 
BMA  coding  gain  is  positive  for  all  SNR’s  above  lldB.  Also,  the  FIA  net  coding  gain 
is  1.5dB  and  l.SdB  for  system  BER’s  10"®  and  10"®,  respectively,  whereas  the  BMA 
net  coding  gain  is  small  for  both  these  system  BER’s. 

To  validate  the  simulation,  decoding  errors  (those  errors  that  give  rise  to  unde¬ 
tected  word  errors)  were  disabled  by  giving  the  decoder  knowledge  of  the  transmitted 
codewords.  Figures  3.2  and  3.3  compare  the  simulation  results  and  the  theoretical 
curves  for  the  BMA  and  FIA,  respectively.  Each  simulation  curve  coincides  with  its 
corresponding  theoretical  curve.  Therefore,  both  simulation  models  are  considered 
valid. 

Next,  both  simulations  were  performed  without  restriction.  Figures  3.4  and 
3.5  compare  the  results  with  the  theoretical  cvuves  and  the  Torrieri  Bounds  [8].  The 
Torrieri  Bounds  assiune  that  the  number  of  bit  errors  in  a  codeword  has  a  binomial 
distribution  (independent  bit  errors).  If  a  codeword  has  t  errors  or  less  for  the  BMA  or 
t'  errors  or  less  for  the  FIA,  the  codeword  is  assumed  to  be  fully  correctable;  otherwise. 
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the  decoder  is  assumed  to  add  (worst  case)  t  errors  for  the  BMA  or  t'  errors  for  the 
FIA  to  the  codeword  for  the  upperbound  calculation,  and  subtract  (best  case)  t  errors 
for  the  BMA  or  t!  errors  for  the  FIA  for  the  lowerbound  calculation.  For  both  the 
BMA  and  FIA  simulations,  the  simulation  curves  were  just  above  the  theoretical 
curves,  indicating  that  in  the  event  of  a  decoding  error,  the  decoder  added  errors  to 
the  received  codeword  slightly  more  often  than  it  subtracted  errors  [8].  Figure  3.6 
shows  the  simulation  performance  curves  for  both  the  BMA  and  the  FIA.  At  a  BER 
of  10  the  BMA  coding  gain  was  -.25dB,  and  the  FIA  coding  gain  was  1.4dB.  The 
10  *  BER  coding  gains  are  not  available  since  the  simulation  was  not  performed  for 
high  SNR  due  to  the  exponential  growth  in  required  execution  time. 
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Figure  3.1.  THEORETICAL  PERFORMANCE  CURVES  FOR  (17,9)  BCH  CODE 


Figure  3.2.  BMA  STATISTICAL  TEST  WITH  DECODING  ERRORS  DISABLED 
FOR  (17,9)  BCH  CODE 
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Figure  3.4.  BMA  STATISTICAL  TEST  WITH  NO  RESTRICTIONS  FOR  (17,9)  BCF 


Figure  3.5.  FIA  STATISTICAL  TEST  WITH  NO  RESTRICTIONS  FOR  (17,9)  BCH 


Uncoded  Eb/No  (dB) 


P(uncoded)  P(channel)  ^  P(coded_BMA)  P(coded_FIA) 


Figure  3.6.  SIMULATION  PERFORMANCE  CURVES  FOR  (17,9)  BCH  CODE 


CHAPTER  IV 


HARDWARE  COMPARISONS 

Two  areas  of  comparison  are  examined  in  this  chapter.  First,  the  critical 
path  is  identified.  The  critical  path  is  the  hardware  complexity  of  the  algorithm 
bottleneck;  it  is  measured  with  respect  to  the  processing  of  one  received  bit.  Second, 
hardware  complexity  of  the  entire  algorithm  is  identified.  Hardware  complexity  is 
similar  to  chip  real-estate  analysis  but  is  viewed  at  a  higher  level.  For  this  chapter,  the 
complexity  will  be  measiured  by  the  number  of  GF  additions,  GF  multiplications,  GF 
multiplicative  inverses,  arithmetic  additions,  arithmetic  multiplications,  comparisons, 
temporary  memory  slots,  and  permanent  memory  slots. 

Critical  Path  for  the  BMA 

Figure  4.1  shows  the  high-level  hardware  layout  for  the  entire  decoder.  This 
layout  calculates  the  syndrome  using  Horner’s  rule  [l].  The  latency  of  this  syn¬ 
drome  calculation  is  the  time  required  to  perform  n  —  1  GF  multiplies  and  n  GF 
additions.  The  maximum  latency  of  the  rest  of  the  decoder  is  the  time  require  to  per¬ 
form  n  — 1  GF  multiplies,  1  GF  addition,  1  GF  multiplicative  inverse,  n  — 1  arithmetic 
additions,  and  2n  comparisons.  The  calculation  of  syndromes  is  then  the  bottleneck, 
since  n  —  \  GF  additions  will  most  likely  require  more  time  than  1  GF  multiplicative 
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inverse,  n—1  arithmetic  additions,  and  2n  comparisons.  Therefore,  the  critical  path 
is  1  GF  addition,  1  GF  multiplication,  and  1  memory  element. 

Critical  Path  for  the  FIA 

Figure  4.2  shows  the  high-level  hardware  layout  for  the  entire  decoder.  This 
layout  also  calculates  the  syndrome  S\  using  Homer’s  rule  [1].  The  latency  of  the 
calculation  and  storage  of  syndromes  is  the  time  required  to  perform  (n  +  6)  GF  mul¬ 
tiplies,  n  GF  additions,  plus  RAM  loading.  The  two  syndrome  RAM’s  must  remain 
unchanged  imtil  the  calculation  of  in  the  Forney  Algorithm,  is  completed.  In  a 
worst  case  scenario,  this  storage  time  is  approximately  the  time  required  to  perform 
6  •  n  GF  multiplies,  6  •  n  GF  additions,  3  GF  reciprocals,  3  multiplies,  n  additions, 
and  n  compares,  which  is  much  longer  than  the  latency  time  for  the  calculation  and 
storage  of  syndromes,  assuming  that  the  time  to  load  the  two  RAM’s  is  relatively 
small.  Therefore,  the  critical  path  lies  between  loading  the  RAM  and  completing  the 
calculation  of  Q(x).  Since  the  critical  path  is  measured  with  respect  to  the  processing 
one  received  bit,  the  FIA  has  no  obvious  critical  path,  since  the  bottleneck  carmot  be 
dissected  in  this  way. 

As  a  side  note,  RAM  1  has  six  bytes  of  memory  (8  bits/memory  element)  with 
a  three  bit  addressing  scheme,  and  RAM  2  has  eight  bytes  of  memory  with  a  three 
bit  addressing  scheme.  Both  RAM’s  can  be  concatenated  into  one  14  byte  RAM  with 
a  foiu:  bit  addressing  scheme,  sacrificing  access  time. 
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Complexity  Comparisons 

Table  4.1  compares  the  hardware  complexity  of  the  two  algorithms.  Since 
the  GF  multiply  is  the  most  complex  of  all  the  operations,  assuming  polynomial 
representation  for  the  field  elements,  the  FIA  is  roughly  20  times  more  complex  than 


the  BMA. 

Table  4.1.  HARDWARE  COMPLEXITY  COMPARISON 


Algorithm 

GF  + 

GFx 

GF  Reciprocal 

+ 

X 

Comparison 

BMA 

2 

2 

1 

1 

0 

3 

FIA 

34 

42 

3 

5 

1 

10 

Algorithm 

Temporary  Memory 

Permanent  Memory 

BMA 

37 

0 

FIA 

128 

14 

ffi  §  ©Oh 


Figure  4. 1 .  BMA  HARDWARE  LAYOUT  FOR  A  (1 7,9)  BCH  DECODER 
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Figure  4.2.  FIA  HARDWARE  LAYOUT  FOR  A  (17,9)  BCH  DECODER 


FIGURE  4.2.  (continued) 
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FIGURE  4.2.  (continued) 


CHAPTER  V 


SUMMARY  AND  CONCLUSIONS 

The  BMA  uses  only  the  designed  distance  of  a  BCH  code,  which  is  often 
smaller  than  the  true  minimiun  distance  [1];  however,  the  FIA  can  be  used  to  exploit 
some  or  all  of  the  imused  distance,  in  many  cases  increasing  the  number  of  correctable 
errors  [3].  Although  the  number  of  correctable  errors  is  an  important  performance 
indicator,  a  more  complete  indicator  is  the  performance  gain/hardware  complexity 
ratio.  At  a  BER  of  10~®,  the  performance  gains  are  -.25dB  and  1.4dB  for  the  BMA 
and  FIA,  respectively;  however,  the  hardware  for  the  FIA  decoder  is  roughly  20  times 
more  complex  than  the  hardware  for  the  BMA.  Also,  the  criticad  path  for  the  BMA 
is  1  GF  addition,  1  GF  multiplication,  and  1  memory  element,  whereas  the  FIA  is  so 
complex  that  the  critical  path  not  easily  identified. 
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CHAPTER  VI 


FUTURE  DIRECTIONS 

Since  the  comparison  of  the  BMA  and  FI  A  is  code-specific,  other  codes  with 
higher  designed  distances  might  be  analyzed  to  give  a  more  balanced  comparison. 
Also,  the  Extended  Fimdamental  Iterative  Algorithm  (EFIA)  might  be  analyzed  for 
net  coding  gain  and  hardware  complexity  [4]. 
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